Heuristic Adaptation of Rate-Based Process Models

نویسندگان

  • Adam Arvay
  • Pat Langley
  • A. ARVAY
  • P. LANGLEY
چکیده

Scientific models are seldom constructed from scratch; more often they are adapted from existing accounts. In this paper, we present a computational approach to this adaptation task in the context of quantitative process models. We review the paradigm of inductive process modeling and discuss RPM, a recently developed program that operates on this task. After this, we describe APM, a new system that adapts a differential equation model to a new setting by revising its parameters or altering its component processes. Next we report experiments that demonstrate APM’s basic abilities and compare its efficiency relative to its predecessor. We conclude by discussing other research on model revision and outlining plans for additional work. 1. Background and Motivation Research on computational scientific discovery (Shrager & Langley, 1990; Džeroski et al., 2007) addresses the construction of laws and models in established scientific formalisms. Much work on this topic has dealt with finding empirical relations that describe regularities in data, such as those appearing early in the stages of a field’s development. There has been much less research on the construction of explanatory models that move beyond the data to account for observations at a deeper level. Work in this area incorporates structured representations and multi-step reasoning over these structures, making the task especially relevant to the cognitive systems community. In this paper, we focus on the problem of inductive process modeling (Langley et al., 2002). Here one is provided with multivariate time series for some dynamic system and background knowledge about the types of processes that can occur in the domain. The goal is to generate a quantitative process model, including numerical parameters, that reproduces the observed trajectories and that predicts new values accurately. Such a model compiles into a set of linked differential equations, but it also accounts for the data in terms of unobserved processes. This distinguishes research in the area from work on differential equation discovery, which describes but does not explain the observations in deeper terms. However, in many cases, scientists are less concerned with creating a model from the ground up than with adapting an existing model to a new setting. This may occur when they believe the system under study has changed so that the model no longer fits observations as well as those for which they constructed it. Or they may have developed the model to explain data from one area and find that it does not fare as well on data for an adjacent area. In either case, they may need to revise the c © 2015 Cognitive Systems Foundation. All rights reserved. Proceedings of the Third Annual Conference on Advances in Cognitive Systems ACS-2015 (Article 22) A. ARVAY AND P. LANGLEY model’s parameters to address quantitative changes, or they may need to alter the model’s structure by removing, adding, or replacing some of its component processes. In the sections that follow, we describe one approach to the task of adapting a process model to explain data in a novel setting. We start by reviewing RPM, a recently developed system for process model induction that is both more reliable and more efficient than earlier approaches. After this, we describe APM, a new system for model adaptation that builds on the ideas that underlie RPM’s successes. Next we report empirical studies designed to show that APM operates as intended and that it offers efficiency gains over inducing a model from scratch. Finally, we discuss previous work on model revision and propose directions for future research. 2. Review of the RPM System In recent research, we have reported a promising approach to inductive process modeling and its implementation in the RPM system (Langley & Arvay, 2015). The framework builds on earlier ones but also introduces some important ideas about representation and processing. In this section, we review these two aspects of RPM in turn, along with some experimental results. 2.1 Representation in RPM Like its predecessors, RPM organizes differential equation models into distinct processes. These identify aspects of the equations that must stand or fall together. For example, ecosystem models include processes such as predation, grazing, growth, loss, and nutrient absorption. However, the system differs from earlier ones by making four key assumptions: • All processes concern changes over time and effect these changes at a specific rate. For instance, a chemical reaction describes interactions among a set of substances, but its rate of operation can vary over time. • Each process has one or more associated derivatives that are proportional to its rate. Some variables are inputs to a process, which it consumes and thus have negative coefficients, while others are outputs, which a process produces and thus have positive coefficients. • The rate of each process is determined by a parameter-free algebraic expression. RPM assumes that rates are always positive and inherently unobservable, so it can adopt any measurement scale it likes, avoiding the need for coefficients. • If a variable appears in the rate expression for a process, then it must also appear as a derivative associated with that process. Along with the standard supposition that the effects of different processes are additive, these postulates mean that one can compile a process model into a set of differential equations that are linear combinations of algebraic rate expressions. When joined with a fifth assumption, that all variables are observed on each time step, this suggests a novel approach to inducing process models that it both efficient and robust. Table 1 presents a simple predator-prey model that illustrates these ideas. Each process has an associated rate expression, one specifying that the rate equals the product of two variables and the others stating that it equals a single variable. Each process also has one or more associated deriva-

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...

متن کامل

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...

متن کامل

A New Optimized Hybrid Model Based On COCOMO to Increase the Accuracy of Software Cost Estimation

The literature review shows software development projects often neither meet time deadlines, nor run within the allocated budgets. One common reason can be the inaccurate cost estimation process, although several approaches have been proposed in this field. Recent research studies suggest that in order to increase the accuracy of this process, estimation models have to be revised. The Construct...

متن کامل

CREDIBILITY-BASED FUZZY PROGRAMMING MODELS TO SOLVE THE BUDGET-CONSTRAINED FLEXIBLE FLOW LINE PROBLEM

This paper addresses a new version of the exible ow line prob- lem, i.e., the budget constrained one, in order to determine the required num- ber of processors at each station along with the selection of the most eco- nomical process routes for products. Since a number of parameters, such as due dates, the amount of available budgets and the cost of opting particular routes, are imprecise (fuzz...

متن کامل

Feature Selection in Structural Health Monitoring Big Data Using a Meta-Heuristic Optimization Algorithm

This paper focuses on the processing of structural health monitoring (SHM) big data. Extracted features of a  structure are reduced using an optimization algorithm to find a minimal subset of salient features by removing noisy, irrelevant and redundant data. The PSO-Harmony algorithm is introduced for feature selection to enhance the capability of the proposed method for processing the  measure...

متن کامل

DESIGN AND APPLICATION OF A HYBRID META-HEURISTIC OPTIMIZATION ALGORITHM BASED ON THE COMBINATION OF PSO, GSA, GWO AND CELLULAR AUTOMATION

Presently, the introduction of intelligent models to optimize structural problems has become an important issue in civil engineering and almost all other fields of engineering. Optimization models in artificial intelligence have enabled us to provide powerful and practical solutions to structural optimization problems. In this study, a novel method for optimizing structures as well as solving s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015